Search CORE

35 research outputs found

Stocator: A High Performance Object Store Connector for Spark

Author: Vernik Gil
Factor Michael
Kolodner Elliot K.
Michiardi Pietro
Ofer Effi
Pace Francesco
Publication venue
Publication date: 08/08/2017
Field of study

We present Stocator, a high performance object store connector for Apache Spark, that takes advantage of object store semantics. Previous connectors have assumed file system semantics, in particular, achieving fault tolerance and allowing speculative execution by creating temporary files to avoid interference between worker threads executing the same task and then renaming these files. Rename is not a native object store operation; not only is it not atomic, but it is implemented using a costly copy operation and a delete. Instead our connector leverages the inherent atomicity of object creation, and by avoiding the rename paradigm it greatly decreases the number of operations on the object store as well as enabling a much simpler approach to dealing with the eventually consistent semantics typical of object stores. We have implemented Stocator and shared it in open source. Performance testing shows that it is as much as 18 times faster for write intensive workloads and performs as much as 30 times fewer operations on the object store than the legacy Hadoop connectors, reducing costs both for the client and the object storage service provider

arXiv.org e-Print Archive

FigShare

Establishing local temporal heap safety properties with applications to compile-time memory management

Author: Kolodner Elliot K.
Sagiv Mooly
Shaham Ran
Yahav Eran
Publication venue: Elsevier B.V.
Publication date: 31/10/2005
Field of study

AbstractWe present a framework for statically reasoning about temporal heap safety properties. We focus on local temporal heap safety properties, in which the verification process may be performed for a program object independently of other program objects. We apply our framework to produce new conservative static algorithms for compile-time memory management, which prove for certain program points that a memory object or a heap reference will not be needed further. These algorithms can be used for reducing space consumption of Java programs. We have implemented a prototype of our framework, and used it to verify compile-time memory management properties for several small, but interesting example programs, including JavaCard programs

Elsevier - Publisher Connector

AIOps for a Cloud Object Storage Service

Author: Barabash Katherine
Garion Shelly
Kolodner Elliot K.
Kugler Mike
Levin Anna
Lorenz Dean H.
McShane Niall
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 06/05/2020
Field of study

With the growing reliance on the ubiquitous availability of IT systems and services, these systems become more global, scaled, and complex to operate. To maintain business viability, IT service providers must put in place reliable and cost efficient operations support. Artificial Intelligence for IT Operations (AIOps) is a promising technology for alleviating operational complexity of IT systems and services. AIOps platforms utilize big data, machine learning and other advanced analytics technologies to enhance IT operations with proactive actionable dynamic insight. In this paper we share our experience applying the AIOps approach to a production cloud object storage service to get actionable insights into system's behavior and health. We describe a real-life production cloud scale service and its operational data, present the AIOps platform we have created, and show how it has helped us resolving operational pain points.Comment: 5 page

arXiv.org e-Print Archive

Crossref

Parallel Copying Garbage Collection using Delayed Allocation

Author: Elliot K. Kolodner
Erez Petrank
Publication venue
Publication date: 01/01/1999
Field of study

We present a new approach to parallel copying garbage collection on symmetric multiprocessor (SMP) machines appropriate for Java and other object-oriented languages. Parallel, in this setting, means that the collector runs in several parallel threads. Our collecto

CiteSeerX

A Note on the Implementation of Replication-Based Garbage Collection for Multithreaded Applications and Multiprocessor Environments

Author: Alain Azagury Elliot
Elliot K. Kolodner
Erez Petrank
Publication venue
Publication date: 01/01/1998
Field of study

Replication-based incremental garbage collection [5, 4] is one of the more appealing concurrent garbage collection algorithms known today. It allows continuous operation of the application (the mutator) with very short pauses for garbage collection. There is a growing need for such garbage collectors suitable for a multithreaded environments such as the Java Virtual Machine

CiteSeerX

Combining Card Marking with Remembered Sets: How to Save Scanning Time

Author: Alain Azagury Elliot
Elliot K. Kolodner
Erez Petrank
Zvi Yehudai
Publication venue
Publication date: 01/01/1998
Field of study

We consider the combination of card marking with remembered sets for generational garbage collection as suggested by Hosking and Hudson [3]. When more than two generations are used, a naive implementation may cause excessive and wasteful scanning of the cards and thus increase the collection time. We offer a simple data structure and a corresponding algorithm to keep track of which cards need be scanned for which generation. We then extend these ideas for the Train Algorithm of [4]. Here, the solution is more involved, and allows tracking of which card should be scanned for which carcollection in the train

CiteSeerX

A Generational On-the-fly Garbage Collector for Java

Author: Elliot K. Kolodner
Erez Petrank
Tamar Domani
Publication venue
Publication date: 01/01/2000
Field of study

An on-the-fly garbage collector does not stop the program threads to perform the collection. Instead, the collector executes in a separate thread (or process) in parallel to the program. On-the-fly collectors are useful for multithreaded applications running on multiprocessor servers, where it is important to fully utilize all processors and provide even response time, especially for systems for which stopping the threads is a costly operation. In this work, we report on the incorporation of generations into an on-the-fly garbage collector. The incorporation is non-trivial since an on-the-y collector avoids explicit synchronization with the program threads. To the best of our knowledge this incorporation has not been tried before. We have implemented the collector for a prototype Java Virtual Machine on AIX, and measured its performance on a 4-way multiprocessor. As for other generational collectors, an on-the-fly generational collector has the potential for reducing the overall running time and working set of an application by concentrating collection efforts on the young objects. However, in contrast to other generational collectors, on-the-fly collectors do not move the objects; thus, there is no segregation between the old and the young objects. Furthermore, on-the-fly collectors do not stop the threads, so there is no extra benefit for the short pauses obtained by generational collection. Nevertheless, comparing our on-the-fly collector with and without generations, it turns out that the generational collector performs better for most applications. The best reduction in overall running time for the benchmarks we measured was 25%. However, there were some benchmarks for which it had no e ect and one for which the overall running time increased by 4%

CiteSeerX

Crossref